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Abstract. Grothendieck inequalities are fundamental inequalities which are 
frequently used in many areas of mathematics and computer science. They 
can be interpreted as upper bounds for the integrality gap between two op- 
timization problems: a difficult scmidefinite program with rank-1 constraint 
and its easy semidefinite relaxation where the rank constrained is dropped. 
For instance, the integrality gap of the Goemans-Williamson approximation 
algorithm for MAX CUT can be seen as a Grothendieck inequality. In this 
paper we consider Grothendieck inequalities for ranks greater than 1 and we 
give two applications: approximating ground states in the n- vector model in 
statistical mechanics and XOR games in quantum information theory. 



1. Introduction 

Let G — (V, E) be a graph with finite vertex set V and edge set E C ( ^) ■ Let 
A: V x V — >Rbea symmetric matrix whose rows and columns are indexed by the 
vertex set of G, and r be a positive integer. The graphical Grothendieck problem 
with rank-r constraint is the following optimization problem: 

SDP. r (G,A)=max[ £ A(u,v)f(u) •/(«):/: V -> S^ 1 j, 

where S r ~ 1 = {x £ W : x ■ x = 1} is the (r — l)-dimensional unit sphere. The 
rank-r Grothendieck constant of the graph G is the smallest constant K(r, G) so 
that for all symmetric matrices A: V x V — > R the following inequality holds: 

(1) SDP oc (G,A) <K(r,G)ST>P r {G,A). 

Here S°° denotes the unit sphere of the Hilbert space Z 2 (R) of square summable 
sequences, which contains R™ as the subspace of the first n components. It is easy 
to see that K (r, G) > 1. In this paper, we prove new upper bounds for K(r, G). 
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1.1. Some history. Inequality ([T]) is called a Grothendieck inequality because it 
first appeared in the work [22 of Grothendieck on the metric theory of tensor 
products. More precisely, Grothendieck considered the case r = 1 for 2-chromatic 
(bipartite) graphs, although in quite a different language. (A k-chromatic graph is 
a graph whose chromatic number is k, i.e., one can color its vertices with k colors 
so that adjacent vertices get different colors, but k— 1 colors do not suffice for this.) 
Grothendieck proved that in this case K (1, G) is upper bounded by a constant that 
is independent of the size of G. 

Later, Lindenstrauss and Pelczyhski [33 reformulated Grothendieck's inequality 
for bipartite graphs in a way that is very close to the formulation we gave above. The 
graphical Grothendieck problem with rank-1 constraint was introduced by Alon, 
Makarychev, Makarychev, and Naor [2] . Haagerup [23] considered the complex case 
of Grothendieck's inequality; his upper bound is also valid for the real case r = 2. 
The higher rank case for bipartite graphs was introduced by Briet, Buhrman, and 
Toner [11]. 

1.2. Computational perspective. There has been a recent surge of interest 
in Grothendieck inequalities by the computer science community. The problem 
SDP r (G, A) is a semidefinite maximization problem with rank-r constraint: 



where is the set of matrices X : VxV->I that are positive semidefinite. 

On the one hand, SDP r (G, A) is generally a difficult computational problem. For 
instance, if r = 1 and G is the complete bipartite graph K n>n on 2n nodes, and if A 
is the Laplacian matrix of a graph G' on n nodes, then computing SDPi (K n ^ n , A) 
is equivalent to computing the weight of a maximum cut of G' . The maximum cut 
problem (MAX CUT) is one of Karp's 21 NP-complete problems. On the other 
hand, if we relax the rank-r constraint, then we deal with SDP^G, A), which is an 
easy computational problem: Obviously, one has SDP^G, A) = SDP|y|(G, A) and 
computing SDP|y|(G, A) amounts to solving a semidefinite programming problem 
(see e.g. Vandenberghe, Boyd [47]). Therefore one may approximate it to any 
fixed precision in polynomial time by using the ellipsoid method or interior point 
algorithms. 

In many cases the optimal constant K(r , G) is not known and so one is interested 
in finding upper bounds for K(r,G). Usually, proving an upper bound amounts 
to giving a randomized polynomial-time approximation algorithm for SDP r (G, A). 
In the case of the MAX CUT problem, Goemans and Williamson [21] pioneered 
an approach based on randomized rounding: One rounds an optimal solution 
of SDPqo (G, A) to a feasible solution of SDP r (G, A) . The expected value of the 
rounded solution is then related to the one of the original solution, and this gives 
an upper bound for K(r,G). Using this basic idea, Goemans and Williamson [2Tj 
showed that for all symmetric matrices A: V x V — > R which have the properties 
A(u, v) < for u distinct from v and ^2 u£V A{u, v) = for all v € V, we have 




= max 




X(u, u) = 1 for all u e V , 



VxV 
>0 




SDPootKn.n.A) < (0.878...r 1 SDP 1 (K„,„,A). 
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1.3. Applications and references. Grothendieck's inequality is a fundamental 
inequality in the theory of Banach spaces. Many books on the geometry of Banach 
spaces contain a substantial treatment of the result. We refer for instance to the 
books by Pisier [30], Jameson [53], and Garling |20) . 

During the last years, especially after Alon and Naor [3] pointed out the con- 
nection between the inequality and approximation algorithms using semidefinitc 
programs, Grothendieck's inequality has also become a unifying and fundamental 
tool outside of functional analysis. 

It has applications in optimization (Nesterov [39], Nemirovski, Roos, Terlaky 
[55] . Megretski [35])) extremal combinatorics (Alon, Naor [5]), system theory (Ben- 
Tal, Nemirovski [9]), machine learning (Charikar, Wirth [14], Khot, Naor p5l[26] h 
communication complexity (Linial, Shraibman [32]), quantum information the- 
ory (Tsirel'son [3j}], Regev, Toner [43 ), and computational complexity (Khot, 
O'Donnell [28 , Arora, Berger, Kindler, Safra, Hazan [5J, Khot and Naor [27] , 
Raghavendra, Steurer [41] ). 

The references above mainly deal with the combinatorial rank r = 1 case, when 
S a = {— 1,+1}. For applications in quantum information (Briet, Buhrman, Toner 
[TT] ) and in statistical mechanics (mentioned in Alon, Makarychev, Makarychev, 
Naor [2], Kindler, Naor, Schechtman [29]) the more geometrical case when r > 1 is 
of interest — this case is the subject of this paper. 

Before we present our results we consider the application to statistical mechanics: 
The n-vector model, introduced by Stanley [45) . describes the interaction of particles 
in a spin glass with ferromagnetic and antiferromagnetic interactions. The case 
n = 1 corresponds to the Ising model, the case n = 2 to the XY model, the 
case n — 3 to the Heisenberg model, and the case n — oo to the Berlin-Kac 
spherical model. 

Let G = (V, E) be the interaction graph where the vertices are particles and 
where edges indicate which particles interact. The potential function A: V xV ^ M. 
is if u and v are not adjacent, it is positive if there is ferromagnetic interaction 
between u and v, and it is negative if there is antiferromagnetic interaction. The 
particles possess a vector- valued spin f:V—> S™^ 1 . In the absence of an external 
field, the total energy of the system is given by the Hamiltonian 

H(f) = - A(tt, «)/(«) •/(«). 

The ground state of this model is a configuration of spins / : V — > S 1 ™ -1 which mini- 
mizes the total energy. Finding the ground state is the same as solving SDP„(G, A). 
Typically, the interaction graph has small chromatic number, e.g. the most common 
case is when G is a finite subgraph of the integer lattice U 1 where the vertices are 
the lattice points and where two vertices are connected if their Euclidean distance 
is one. These graphs are bipartite since they can be partitioned into even and odd 
vertices, corresponding to the parity of the sum of the coordinates. 

We briefly describe the relation to quantum information theory. In an influential 
paper, Einstein, Podolsky, and Rosen [18] pointed out an anomaly of quantum me- 
chanics that allows spatially separated parties to establish peculiar correlations by 
each performing measurements on a private quantum system: entanglement. Later, 
Bell [8] proved that local measurements on a pair of spatially separated, entangled 
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quantum systems, can give rise to joint probability distributions of measurement 
outcomes that violate certain inequalities (now called Bell inequalities) , satisfied by 
any classical distribution. Experimental results of Aspect, Grangier, and Roger [5] 
give strong evidence that nature indeed allows distant physical systems to be cor- 
related in such non-classical ways. 

XOR games, first formalized by Cleve, H0yer, Toner, and Watrous [15], consti- 
tute the simplest model in which entanglement can be studied quantitatively. In 
an XOR game, two players, Alice and Bob, receive questions u and v (resp.) that 
are picked by a referee according to some probability distribution 7t(m, v) known 
to everybody in advance. Without sharing their questions, the players have to 
answer the referee with bits a and b (resp.), and win the game if and only if the 
exclusive-OR of their answers a ® b equals the value of a Boolean function g(u, v); 
the function g is also known in advance to all three parties. 

In a quantum-mechanical setting, the players determine their answers by per- 
forming measurements on their shares of a pair of entangled quantum systems. A 
state of a pair of d-dimensional quantum systems is a trace- 1 positive semidefinite 
operator p € C^ x<i . The systems are entangled if p is not a convex combination 
of tensor products of d-by-d positive semidefinite matrices. For each question u, 
Alice has a two-outcome measurement defined by a pair of d-by-d positive semi- 
definite matrices that satisfies A" + A\ = I, where / is the identity 
matrix. Bob has a similar pair {B®,Bl} for each question v. When the players 
perform their measurements, the probability that they obtain bits a and b is given 
byTr(A a u ®B b vP ). 

The case d — 1 corresponds to a classical setting. In this case, the maximum 
winning probability equals (l + SDPi(G, A))/2, where G is the complete bipar- 
tite graph with Alice and Bob's questions on opposite sides of the partition, and 
A(u,v) = (—1) 9 ( u ' vS> tt(u,v)/2 for pairs {u,v} <E E and A(u,v) — everywhere else. 

Tsirel'son |46j related the maximum winning probability u>^(TT,g) of the game 
(it, g), when the players are restricted to measurements on d-dimensional quantum 
systems, to the quantity SDP r (G, A). In particular, he proved that 

l + SDP [logrfJ (G,A) l + SDP 2d (G,A) 
2 < w d (?r,g) < - . 

The quantity SDP,.(G, A) thus gives bounds on the maximum winning probability 
of XOR games when players are limited in the amount of entanglement they are 
allowed to use. The rank-r Grothendieck constant K(r, G) of the bipartite graph G 
described above gives a quantitative bound on the advantage that unbounded en- 
tanglement gives over finite entanglement in XOR games. 

1.4. Our results and methods. The purpose of this paper is to prove explicit 
upper bounds for K(r,G). We are especially interested in the case of small r 
and graphs with small chromatic number, although our methods are not restricted 
to this. The proof of the following theorem gives a randomized polynomial-time 
approximation algorithm for approximating ground states in the Heisenberg model 
in the lattice Z 3 with approximation ratio 0.78 . . . = (1.28 . . This result can 
be regarded as one of the main contributions of this paper. 
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Theorem 1.1. For r = 1, . . . , 10 and in the case of a bipartite or a tripartite 
graph G the rank-r Grothendieck constant is at most: 



r 


bipartite G 


tripartite G 


1 


1.782213... 


3.264251... 


2 


1.404909. . . 


2.621596. . . 


3 


1.280812... 


2.412700... 


4 


1.216786... 


2.309224... 


5 


1.177179... 


2.247399 . . . 


6 


1.150060... 


2.206258... 


7 


1.130249... 


2.176891... 


8 


1.115110... 


2.154868... 


9 


1.103150... 


2.137736... 


10 


1.093456... 


2.124024... 



The bound for the original Grothendieck constant K(1,G) for bipartite G is 
due to Krivine [31] . For more than thirty years this was the best known upper 
bound, and it was conjectured by many to be optimal. However, shortly after 
our work appeared in preprint form, Braverman, Makarychev, Makarychev and 
Naor [lOj showed that Krivine's bound can be slightly improved. The best known 
lower bound is 1.676956 . . . due to Davie [TB] and Reeds [12] (see also Khot and 
O'Donnell [25]). The bound for K(2,G) is due to Haagerup [23]. 

When the graph G has large chromatic number, then the result of Alon, Maka- 
rychev, Makarychev, and Naor [2] gives the best known bounds for K(l, G): They 
prove a logarithmic dependence on the chromatic number of the graph (actually on 
the theta number of the complement of G, cf. Section[4]) whereas our methods only 
give a linear dependence. Although our main focus is on small chromatic numbers, 
for completeness we extend the results of [2] for large chromatic numbers to r > 2 
in Section[B] In a previous paper [T3J we proved that K(r, K n ,n) = 1 + Q(l/r). 

For the proof of Theorem 11.11 we use the framework of Krivine and Haagerup 
which we explain below. Our main technical contributions are a matrix version of 
Grothendieck's identity (Lemma 12.11) and a method to construct new unit vectors 
which can also deal with nonbipartite graphs (Lemma 14. ip . 

The strategy of Haagerup and Krivine is based on the following embedding 
lemma: 

Lemma 1.2. Let G = (V,E) be a graph and choose Z = (Zij) G R rx l v at random 
so that each entry is distributed independently according to the normal distribution 
with mean and variance 1, that is, Zij ~ -/V(0, 1). 

Given f:V—> S^' 1 , there is a function g: V —> S> v \ 1 such that whenever u 
and v are adjacent in G, then 

Zg(u) Zg(v) 



E 



= [3(r,G)f(u)-f(v) 



\Zg(u)\\ \\Zg(v)\ 
for some constant fj(r, G) depending only on r and G. 

In the statement above we are vague regarding the constant f3(r,G). We will 
give the precise statement of the lemma in Section |4] (cf. Lemma 14.11 there) , right 
now this precise statement is not relevant to our discussion. 
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Now, the strategy of Haagerup and Krivine amounts to analyzing the follow- 
ing four-step procedure that yields a randomized polynomial-time approximation 
algorithm for SDP r (G, A): 

Algorithm A. Takes as input a finite graph G = (V, E) with at least one edge and 
a symmetric matrix A: V x V — > M, and returns a feasible solution h:V—> S*' -1 
of SDP r (G,A). 

(1) Solve SDP oc (G, A), obtaining an optimal solution /: V -> S^^ 1 . 

(2) Use / to construct g: V — > according to Lemma [L2l 

(3) Choose Z = (Zij) G K ,x l y l at random so that every matrix entry Zy is 
distributed independently according to the standard normal distribution 
with mean and variance 1, that is, ~ N(0, 1). 

(4) Define h: V -> S^ 1 by setting h(u) = Zg(u)/\\Zg(u)\\. 

To analyze this procedure, we compute the expected value of the feasible solu- 
tion h. Using Lemma fOl we obtain 

SDP r (G, A) > E A ( u ' V M U ) ■ h ( v ) 

{u,v}£E 

= A{u,v)E[h(u) ■ h(v)} 

( 2 ) {u,v}EE 

= P{r,G) A(u,v)f(u)-f(v) 

{u,v}eE 
= P{r,G) SDP 00 (G,A), 

and so we have K(r, G) < j3(r, G) _1 . 

If we were to skip step (2) and apply step (4) to / directly, then the expecta- 
tion ¥,[h(u) ■ h(v)] would be a non-linear function of f(u) ■ f(v), which would make 
it difficult to assess the quality of the feasible solution h. The purpose of step (2) is 
to linearize this expectation, which allows us to estimate the quality of h in terms 
of a linear function of SDP r (G, A). 

The constant /3(r, G) in Lemma ll.2l is defined in terms of the Taylor expansion 
of the inverse of the function E r : [— 1, 1] — > [— 1, 1] given by 



E r {x ■ y) = E 



Zx Zy 



\Zx\\ \\Zy\\ 



where x, y £ S°° and Z = (Zij) £ R rxo ° is chosen so that its entries are indepen- 
dently distributed according to the normal distribution with mean and variance 1. 
The function E r is well-defined since the expectation above is invariant under or- 
thogonal transformations. 

The Taylor expansion of E r is computed in Section [5] The Taylor expansion 
of E'jT 1 is treated in Section [31 where we basically follow Haagerup [23 . A precise 
version of Lemma 11.21 is stated and proved in Section [U following Krivine [3]] • 

Finally, in Section [S] we show that one can refine this analysis and can (strictly) 
improve the upper bound if one takes the dimension of the matrix A: V x V — >• R 
into account. In particular, we compare the problems SDP g and SDP r for q > 
r. Earlier, Avidor and Zwick [7] considered the problem of bounding the ratio 
SDP 9 (G, A)/ SDPi(G, A) for q = 2, 3 and A the Laplacian matrix of a graph. 
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2. A MATRIX VERSION OF GROTHENDIECK'S IDENTITY 



In the analysis of many approximation algorithms that use semidefinite pro- 
gramming the following identity plays a central role: Let u, v be unit (column) 
vectors in R" and let Z 6 R lxn be a random (row) vector whose entries are dis- 
tributed independently according to the standard normal distribution with mean 
and variance 1. Then, 



E[sign(Zu) sign(Zv)] = E 



Zu 
\Zu\ 



Zv 
\Zv\\ 



— arcsin(u • v). 



For instance, the celebrated algorithm of Goemans and Williamson [21] for 
approximating the MAX CUT problem is based on this. The identity is called 
Grothendieck's identity since it appeared for the first time in Grothendieck's work 
on the metric theory of tensor products [221 Proposition 4, p. 63] (see also Diestel, 
Fourie, and Swart [T7]). 

In this section we extend Grothendieck's identity from vectors to matrices by 
replacing the arcsine function by a hypergeometric function. 

Lemma 2.1. Let u, v be unit vectors in M n and let Z £ R rx " be a random ma- 
trix whose entries are distributed independently according to the standard normal 
distribution with mean and variance 1. Then, 

2 



E 



Zu 

izuii 



Zv 



Zv 



r((r + i)/2; 

r(r/2) 



(u ■ v) 2-Fl 



1/2,1/2 
r/2 



I ; (u ■ v)^ 



Here, 



(u ■ v) 2-Fi 



1/2,1/2 
r/2 + 1 



E 

k=0 



(u-vf 
(1-3- 



{2k-l)f 



(2 • 4 • • • 2k)((r + 2) • (r + 4) • • • (r + 2k)) 



(u ■ v) 



2k+l 



Before proving the lemma we review special cases known in the literature. If 
r = 1, then we get the original Grothendieck's identity: 

2 



E[sign(Zu) sign(Zv)] 



■ arcsin(u • v 

TT 

2 

— | u • v 

TT 



(u ■ v) 3 



1 • 3 
2~ 4 



(u • v) 5 



The case r 
Zu 

E 



2 is due to Haagerup 



Zv 



Zu 



Zv 



1 



(E(u ■ v) - (1 - (u ■ vf )K{u ■ v)) 



TT 

4 



u ■ V + 



(u ■ v) 



1 ■ 3 
2~4 



(u • v) 1 



where K and i5 are the complete elliptic integrals of the first and second kind. 
Note that on page 201 of Haagerup [23 it/ 2 has to be tt/A. Briet, Oliveira, and 
Vallentin [12] computed the first coefficient 2/r(F((r + l)/2)/r(r/2)) 2 of the Taylor 
series of the expectation for every r. 

The following elegant proof of Grothendieck's identity has become a classic: We 
have sign(-Zu) sign(Zw) = 1 if and only if the vectors u and v lie on the same 
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side of the hyperplane orthogonal to the vector Z £ M. lxn . Now we project this n- 
dimensional situation to the plane spanned by u and v. Then the projected random 
hyperplane becomes a random line. This random line is distributed according to the 
uniform probability measure on the unit circle because Z is normally distributed. 
Now one obtains the final result by measuring intervals on the unit circle: The 
probability that u and v lie on the same side of the line is 1 — arccos(w • v)/n. 

However, we do not have such a picture proof for our matrix version. Our proof 
is based on the rotational invariance of the normal distribution and integration with 
respect to spherical coordinates together with some identities for hypergeometric 
functions. A similar calculation was done by Konig and Tomczak-Jaegermann [30) . 
It would be interesting to find a more geometrical proof of the lemma. 

For computing the first coefficient of the Taylor series in [12] we took a slightly 
different route: We integrated using the Wishart distribution of 2 x 2-matrices. 



Proof of Lemma \2.1\ Let Zi £ 
We define vectors 

(Zx ■ u\ 

Z 2 ■ u 

x 



be the i-th row of the matrix Z , with i = 1, . 



and 



Z 2 - 



y = 



\Z r ■ uj \Z r ■ v J 

so that we have x-y — (Zu)-(Zv). Since the probability distribution of the vectors Zi 
is invariant under orthogonal transformations we may assume that u = (1, 0, . . . , 0) 
and v — (t, vl — t 2 , 0, . . . , 0) and so the pair (x, y) £ WxW is distributed according 
to the probability density function (see e.g. Muirhead 07j p. 10, eq. (7)]) 



(27r^l-t 2 )- 



exp 



x ■ x — 2tx ■ y + y ■ y 
2(I~^j 



Hence, 



x 



y 



■ exp 



x ■ x — 2tx ■ y + y ■ y 



dxdy. 



WW \ 2(1 -£ 2 ) 

By using spherical coordinates x — a£, y = /3rj, where a,/3 £ [0, oo) and £, rj £ 5 r_1 , 
we rewrite the above integral as 

a 



OO pOO 



(a/3) 



r-l 



exp 



2 , 01 



2(1 -t>) 



s r ~ 



£ ■ rj exp 



a/3t£ ■ r\ 
1-t 2 
dui(t;)dui(r])dad(3. 



If r = 1, we get for the inner double integral 

' a(3t^ ■ rj 
IF 



£ • V exp 

s° Js° \ 1 



du)(£)dijj(ri) 



= 4 sinh 



a/3t 
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Now we consider the case when r > 2. Since the inner double integral over the 
spheres only depends on the inner product p = £ • 77 it can be rewritten as 

^S'-XS'- 1 ) I' pexp ( (l- P ^-^dp 



l-t 2 



where 

4^-1/2 

1 ' { ' r(r/2)r((r-l)/2) 
Integration by parts yields 



(r-l)(l-i 2 ) ' " ' ' ■■ M ' 1 1 " 

The last integral can be rewritten using the modified Bessel function of the first 
kind (cf. Andrews, Askey, Roy [2J p. 235, Exercise 9]) 



1 f-i 2s(r-l)/2 ( \ 



dp 



'2(l-i 2 )\ r T ( apt 
r((r + 1)/2)V^F ( ] / r/a ' 



apt ) rjA \\-t 2 

One can write I r / 2 as a hypergeometric function (cf. Andrews, Askey, and Roy [4j 
(4.12.2)]) 

i . (x)-(x/2Y/*^r {x,2)2k {x/2)r/2 F ( ~ 

i r/2 (X)-[x/z) ^ k \T(r/2 + k+l) r((r + 2)/2)° 1 \(r + 2)/2 ' V2 
Putting things together, we get 

47r r oi^f _, / / a0t 1 " 



r(r/2) 2 r 1 -t 2 ° Fl ^(r + 2)/2 ' ^2(l-< 2 ) 

Notice that the last formula also holds for r = 1. So we can continue without case 
distinction. 

Now we evaluate the outer double integral 

00 r °° / a 2 +p 2 \ _ / f apt ;i 



W ex P ( -^—^2) J 0^1 I (r + 2 )/2 ; (, 5(1^) ) I dadfi. 



Here the inner integral equals 



CO / 9 







'' <X1,( -2(T^J oFl ((r + 2)/2 ; (2Tr=^J ! ' /n - 



and doing the substitution 7 = a 2 / (2(1 — t 2 )) gives 

2 <,-.v 3(1 _ t ^ + .,/^% (r - W2exp( _ 7)oFl ( (r - /2i _2»_) „, 
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which is by the Bateman Manuscript Project [El p. 337 (11)] equal to 

2 (r-l)/2 fl _ i 2 ) (r+l)/2 rff +1 w 2 N ^ ( {t + l)/2 , W 

Now we treat the remaining outer integral in a similar way, using [19[ p. 219 (17)], 
and get that 

io " eX H _ ^^J lFl l(^ + 2)/2'2(l-^)J d/3 

= 2<- 1 >/ 2 (l - ^)M/»r((r + l)/^ ( (r + ^ (r + l)/2 . ^ 

By applying Euler's transformation (cf. Andrews, Askey, and Roy [U (2.2.7)]) 

f(r + l)/2, (r + l)/2 . 2 \ _ , . 2 ,_ r/2 / 1/2, 1/2 . 2 \ 
2Fl (, (r + 2)/2 ,*J-(l-0 * F i[ {r + 2 )/2> t ) 

and after collecting the remaining factors we arrive at the result. □ 



3. Convergence radius 

To construct the new vectors in the third step of the algorithm that are used 
to linearize the expectation we will make use of the Taylor series expansion of the 
inverse of E r . Locally around zero we can expand the function E" 1 as 



,2fc+l 

'2fc+l 
fc=0 

but in the proof of Lemma 14.11 it will be essential that this expansion be valid for 



of of Lemma [ 

allte [-1,1]. 

In the case r = 1 we have E'-f x (t) = sin(7r/2t) and here the convergence radius 
is even infinity. The case r = 2 was treated by Haagerup and it requires quite 
some technical work which we sketch very briefly now. He shows that \bk\ < C/k 2 
for some constant C, independent of k, using tools from complex analysis. Using 
Cauchy's integral formula and after doing some simplifications p. 208] one can 
express bk as 

&* = 4" / %(E 2 (z)- k )dz + ^-%[ I E 2 (z)- k dz 



71 A" 



J\{E 2 {z)- k )dz + ^ ^ 



where C' a is the quarter circle { ae l9 : 6 € [0, 7i/2] }. 

For an appropriate choice of a the first integral is in absolute value bounded 
above by C/k and the second integral is in absolute value exponentially small in A;. 
We refer to the original paper for the details. One key point in the arguments is 
the following integral representation of E 2 giving an analytic continuation of E 2 on 
the complex plane slit along the half line (1, oo): 

,7T/2 

E 2 (z) = / sin 8 arcsin(2: sin6>) dQ. 
Jo 

Here, the term arcsin(zsin#) gives the main contribution in the estimates. 

Now we derive a similar representation of E r and using it in Haagerup's analysis 
with obvious changes shows that also for r > 2 we have bk < C/k 2 for some constant 
C, independent of k. 
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Lemma 3.1. For r > 2 we have 



2(r-l)r((r + l)/2) f^ 2 ,_ a , 



E r (z) = / cos r 6» sin arcsin(z sin 6)d6. 

r(l/2)r(r/2) J 

Proof. Using Euler's integral representation of the hypergeometric function (cf. An- 
drews, Askey, and Roy [4j Theorem 2.2.1]) we can rewrite E r as 

r((r + i)/2) r 1 (i-t)<r-Wz 
KKz) r(i/2)r(r/2) k ^iy^zH) 

which is valid in the complex plane slit along the half line (l,oo). Using the sub- 
stitution t = sin 2 9 we get 



r(l/2)r(r/2) J ^i_ z 2 sin 2 
Now integration by parts and the identity 

d . f . , zcosg 
arcsm(z smt 



^ VI - z 2 sin 2 6» 

gives the result. □ 



4. Constructing new vectors 

In this section we use the Taylor expansion of the inverse of the function E r to 
give a precise statement and proof of Lemma 11.21 this is done in Lemma 14.11 For 
this we follow Krivine [21] , who proved the statement of the lemma in the case of 
bipartite graphs. We comment on how his ideas are related to our construction, 
which can also deal with nonbipartite graphs, after we prove the lemma. 

For the nonbipartite case we need to use the theta number, which is a graph 
parameter introduced by Lovasz [34]. Let G = (V,E) be a graph. The theta 
number of the complement of G, denoted by #((?), is the optimal value of the 
following semidefinite program: 

0(5) = mini A: Z eR^ v , 

(3) Z(u, u) = A - 1 for u £ V, 

Z(u,v) = -1 for {u,v}€E\. 

It is known that the theta number of the complement of G provides a lower bound 
for the chromatic number of G. This can be easily seen as follows. Any proper 
fc-coloring of G defines a mapping of V to the vertices of a (k — l)-dimensional 
regular simplex whose vertices lie in a sphere of radius y/k — 1: Vertices in the 
graph having the same color are sent to the same vertex in the regular simplex and 
vertices of different colors are sent to different vertices in the regular simplex. The 
Gram matrix of these vectors gives a feasible solution of ©. 

Lemma 4.1. Let G = (V, E) be a graph with at least one edge. Given /: V — > 
S^l- 1 , there is a function g : V — > S^ v ^ 1 such that whenever u and v are adjacent, 
then 

E r {g{u)-g{v))=f3{r 1 G)f{u)-f(v). 
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The constant /3(r, G) is defined as the solution of the equation 

co 1 

where 

oo 
k=Q 

With this lemma, we can give a proof of Theorem II. II 

Proof of Theorem ll.il We combine Lemma I4.ll with the analysis of Algorithm A 
from Section fl .41 To compute the table in the theorem, we use the formula 

t=o 

where dj are the Taylor coefficients of E r (cf. Morse and Feshbach [36j (4.5.13)]). □ 



(4) b, ' 



k\a\ 



Now we give a proof of the lemma. 

Proof of Lemma \4-1\ We construct the vectors g(u) £ S 1 '^'" 1 by constructing vec- 
tors R(u) in an infinite-dimensional Hilbert space whose inner product matrix co- 
incides with the one of the g(u). We do this in three steps. 
In the first step, set H = Rl y and consider the Hilbert space 

CO 

fc=0 

For a unit vector x € H , consider the vectors S(x), T(x) £ H given componentwise 

by 

S(x) k = ^\b 2k+1 \l3(r,G)^x^ 2k+1 ^ 

and 

T(x) k = sign(b 2k+1 )^J\b 2k+1 \fi(r, G) 2k+1 x® { - 2k+1 '> . 
Then for vectors x,y £ S^^ 1 we have 

S(x)-T(y) = E- 1 (p(r,G)x-y) 

and moreover 

oo ^ 

S(x) ■ S(x) = T(x) ■ T(x) = J2 \b2 k+ x\P(r, G) 2k+1 = 

fe=0 ~ 1 

In the second step, let A = i?(G), and Z be an optimal solution of ((3|). We 
have A > 2 since G has at least one edge. Set 

(A-1)(J + Z) (A-l)J-Z 

A= 2A and B= 2A ' 

and consider the matrix 

By applying a Hadamard transformation 

-f 7 7 V-f 7 I ) = ( A+B 
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one sees that U is positive scmidefinite, since both A + B and A — B are positive 
semidefmite. Define s: V -» K 2|y| and t: V -> R 2 I V I so that 

s(it) ■ s(u) = t(u) ■ t(v) = A(u, v) and s(u) ■ t(v) = B(u, v). 

The matrix U is the Gram matrix of the vectors (s(uj) y and (t(v)J y . It 
follows that these maps have the following properties: 

(1) s(u) ■ t(u) = for all u £ V, _ 

(2) s(u) ■ s(u) = t(u) ■ t(u) = (i)(G) - l)/2 for all u£V, 

(3) s(u) • s(v) = t{u) ■ t{v) = whenever {u,v} £ E, 

(4) s(u) ■ t(v) = s(v) ■ t(u) — 1/2 whenever {u, v} £ E. 

In the third step we combine the previous two. We define the vectors 

R{u) = s{u) <g) S(f(u)) + t(u) ® T(f(u)). 

For adjacent vertices u,v £ V we have 

R(u)-R(v)=E- 1 (f3(r,G)f(u) •/(«)), 

and moreover the are unit vectors. Hence, one can use the Cholesky decom- 

position of (R(u)-R(v)) £ R^ v to define the desired function g: V -> □ 

We conclude this section with a few remarks on the lemma and its proof: 

(1) To approximate the Gram matrix (R(u) ■ R(v)) it is enough to compute the 
series expansion of E^ 1 and the matrix U to the desired precision. The 
latter is found by solving a semidefmite program. 

(2) Krivine proved the statement of the lemma in the case r — 1 and for 
bipartite graphs G; then, i?(G) = 2 holds. Here, one only needs the first 
step of the proof. Also, /3(1,G) can be computed analytically. We have 
E^ x {t) = sin(vr/2t) and 

OO 

E 

k=0 

Hence, 0(1, G) = 2 arcsinh(l)/7r = 2 ln(l + V2)/n. 

(3) In the second step one can also work with any feasible solution of the 
semidefmite program @. For instance one can replace $(G) in the lemma 
by the chromatic number x(G) albeit getting a potentially weaker bound. 

(4) Alon, Makarychev, Makarychev, and Naor [2 also gave an upper bound for 
K(l, G) using the theta number of the complement of G. They prove that 

if(l,G)<0(logtf(G)), 

which is much better than our result in the case of large "d(G). However, 
our bound is favourable when i?(G) is small. In Section |6] we generalize the 
methods of Alon, Makarychev, Makarychev, and Naor [5] to obtain better 
upper bounds on K(r, G) for r > 2 and large #(G). 

(5) Finally, notice that in the first step it was essential that the Taylor expan- 
sion of E~ x has convergence radius of at least one. 



(-1) 



2fc+l 



(tt/2) 



2/c+l 



(2fc + l)! 



2fc+l 



= sinh(7r/2i). 
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5. A REFINED, DIMENSION-DEPENDENT ANALYSIS 

So far we only compared the two problems SDPoo and SDP r . One can perform a 
refined, dimension-dependent analysis by comparing SDP g and SDP r when q > r. 
Let K(q i — y r,G), where q > r, be the least number such that 

SDP,(G, A) < K(q h> r, G) SDP r (G, A) 

for all A: V x V — > R. In this section we give an upper bound for A(q H> r, G) that 
depends on q and r. For fixed r, this upper bound will become smaller as q comes 
closer to r. Krivine |31j gave such a refined, dimension-dependent analysis in the 
bipartite case; he showed that 

K(2 i y 1, K nin ) = a/2, K(3 i-> 1, K n>n ) < 1.517, and K(4 h> 1, K n , n ) < it/2. 

Avidor and Zwick [7] analyzed the cases r = 1 and q € {2, 3} for bipartite G — K n n 
and A the Laplacian of a graph G' on n nodes. 

Our upper bound comes from the following lemma: 

Lemma 5.1. LetG = (V,E) be a graph with at least one edge. Given f : V — > 
there is a function g: V — > S*!^ -1 smc/i i/iai whenever u and v are adjacent, then 

E r (g{u) ■ g{v)) = 0(q h+ r, G)/(«) ■ /(«), 

where < /3(g i-> r, G) < 1 is such that /3(q H> r, G) > /3(g + l^r,G) and /3(q i-> 
r,G) > /3(r,G) for all q>2. 

The proof of the lemma will also give a procedure to compute (3(q i— > r, G) explicitly. 
So we have the theorem: 

Theorem 5.2. Let G = (V, E 1 ) be a graph with at least one edge and let q > r > 1 
be integers. Then K(q t— > r, G) < i— > r, G) . 

Proof. Combine Lemma 15.11 with Algorithm A from Section 11.41 □ 

The proof of the lemma uses some basic facts from harmonic analysis, which we 
now summarize. For measurable functions /, g: [—1, 1] — > M we consider the inner 
product 

(5) (/,<?>„ = j\t)g(t)(l-t 2 )^/ 2 dt. 

We say that a continuous function /: [— 1, 1] — > M is of positive type for S™^ 1 if for 
any choice xi, . . . , xm of points in we have that the matrix (f(xi ■ Xj)) { ._ x is 
positive semidefinite. If two continuous functions /, g: [—1, 1] — > R are of positive 
type for S n ~\ then (/,(?)„ >0. 

Schoenberg [44j characterized the continuous functions of positive type in terms 
of Gegenbauer polynomials. We denote by P£ the Gegenbauer polynomial of de- 
gree k and parameter (n— 2)/2 which is normalized so that PJ}(1) = 1. Notice that 
this normalization is not the one commonly found in the literature. 

The Gegenbauer polynomials Pq, P", P£, ■ ■ ■ are pairwise orthogonal with 
respect to the inner product ([5]), and they form a complete orthogonal system for 
the space L 2 ([— 1, 1]), equipped with the inner product (|5|). 
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Schoenberg's characterization of the functions of positive type is as follows: A 
function /: [— 1,1] — > M is continuous and of positive type for S"" -1 if and only if 

oo 

(6) /(*)=E a * P fc"(*) 

k=0 

for some nonnegative numbers do, di, 0,2, ■ ■ ■ such that X)fc=o ak converges, in which 
case the series in (j6]) converges absolutely and uniformly in [—1,1]. 

A continuous function / : [— 1 , 1] —> R can also be of positive type for spheres of 
every dimension. Schoenberg [44] also characterized these functions. They are the 
ones that can be decomposed as 

00 

fc=0 

for some nonnegative numbers clq, ai, 02, ... such that X)fc=o a k converges. 

A polynomial in M[xi, . . . , x n ] is harmonic if it is homogeneous and vanishes 
under the Laplace operator A = d 2 jdx\ + • • • + d 2 /dx 2 l . Harmonic polynomials 
restricted to the unit sphere S" 1-1 are related to Gegenbauer polynomials by the 
addition formula (see e.g. Andrews, Askey, and Roy [H Theorem 9.6.3]): Let Hk 
be the space of degree k harmonic polynomials on n variables. Any orthonormal 
basis of Hk can be scaled so as to give a basis e^.i, . . . , ek,h k of Hk for which the 
following holds: For every u, v e S 71 ^ 1 we have that 

hk 

Pk(u -v) e k<i (u)e k ,i(v). 
i=i 

With this we have all that we need to prove the lemma. We only consider 
the bipartite case in the proof in order to simplify the notation and to make the 
argument more transparent. One can handle the nonbipartite case exactly in the 
same way as in the proof of Lemma 14.11 

Proof of Lemma \5.1[ As before, we construct the function g : V — > 5 f ' y ' _1 from 
functions S and T that satisfy S(x) ■ T(y) = E^ 1 (f3x ■ y) for some real number j3. 
Fix < j3 < 1 and consider the expansion 

CO 

E-\pt)=Y,9 q Mpm, 

fc=0 

which converges in the L 2 sense. 

Claim 1. The series X)fc=o ISfeO-OI converges for every < (3 < 1, and hence the 
above expansion converges absolutely and uniformly for t £ [—1, 1]. 

Proof: To prove the claim we use the comparison test. To this end, consider the 
expansion E^ 1 ^) = J2kLo ^kt k and recall that X)fc=o l^ fe l converges. We may define 
the function E r (t) = J2T=o \bk\t k , which is then of positive type for every sphere. 
So by Schoenberg's theorem we can write 

00 

fe=0 
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for nonnegative numbers g| such that X^fcLo 5fc converges. Now notice that 

oo 

9iw = wnv^m^ = iini.~ 2 E^> p *>*> 

1=0 

where H-P^ll? = (P% , P'kUi 2 ■ Above, since t l is a function of positive type for every 
sphere, we have that (t l ,P k ) q > 0. But we also have that 

oo 

gl = \\p q k \\- 2 (E;\p q k ) q = \\p q k \\fJ2M tl ' p ^ 

1=0 

and we see that |ff'(/3)| < g q k for all k > and < (3 < 1. This proves the claim. ♦ 
From the proof of the claim, it it also clear that 

oo 

(7) £W)l 

fc=0 

is a continuous function of /?. 

Now, let f3(q H> r, G) be the maximum number f3 € (0, 1] that is such that 

oo 

E ls2C0l = i- 

By the intermediate value theorem, such a number exists because JT]) is continuous 
as a function of /3, being equal to when j3 = and at least = 1 when /? = 1. 

Consider the Hilbert space 

oo 

ft = 0R hfc , 

fc=0 

equipped with the Euclidean inner product and where h k is the dimension of H k , 
the space of harmonic polynomials of degree k on q variables. For a vector x € S" 9 " 1 , 
consider the vectors S(x) and T(x) € W given componentwise by 

S[x) k = \J\gl(P{q r, G))\(e k ,i(x), e kthk (x)) and 

T(x) k = S ign(gl(f3(q ^ r, G)))yJ\^Kq H- r, G))\ (e M (x), e kM {x)). 

By the addition formula we have that 

S(f(u)) ■ T(f(v)) = E- l ((3(q ^ r, G)f(u) ■ /(«)). 

Moreover, we also have that 

oo 

\\S(f(u))f = \\T(f(v))r = \9im -> r, G))\ = 1, 

k=0 

and so from the Gram matrix of the vectors S(f(u)) and T(f(v)) we may obtain 
the function g: V — >5' y ' _1 aswe wanted. 

Now we show that /3(q i-> r,G) > 0(q + 1 i-> r, G) for all q > 2. To this end, 
consider the function 

oo 

^(*) = Ei^ +1 (- 8 )i p fc +1 (*)- 
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Since the series J^fcLo \dt +1 iP) I converges, from Schoenberg's theorem we see that Fp 
is a continuous function of positive type for the sphere S q . Notice moreover that, by 
definition, (3{q + 1 i— > r, G) is the maximum number j3 G (0, 1] such that Fp(l) = 1. 

Since Fp is of positive type for S q , it is also of positive type for S" 3-1 , and then 
we may write 



k=0 

and we have J2T=o a k{P) = 1 as for all k, Pj*(l) = 1- We also have the expression 

CO 

(8) am = \\P!\\- 2 (F,,P!) q = \\p!\\- 2 y, \9t +1 m{p? + \pi\ 



II- 

1=0 



Notice that, since both P* +1 and P% are of positive type for S q 1 , we have that 



(P q+1 ,P q ) q > for all I and k. 
Now, from the expansion 



k=0 

we see that 



}9\ 

k/i- 



(9) 9 m = \\Pl\\-\E-\ptlPl) q = ||^||- 2 ^ 5 ? +1 (/3)(^ +1 ,^ 

1=0 

The function E^ 1 is not of positive type because the coefficient 63 of its Taylor 
expansion is always negative (this can easily be checked using Eq. (j4])), and so 
some of the gf + (/3) must be negative. This, together with (JSJ) and ©, implies 
that < afe(/3) for all < /3 < 1. So we must have that 

00 00 

E \9 q Ml + 1 G))\ < a k ((3(q + 1 h> r, G)) = 1, 

k=0 k=0 

and we see that (5{q r, G) > j3{q + 1 i-> r, G). In a similar way, one may show 
that P(q h> r,G) > 0(r,G). □ 

6. Better bounds for large chromatic numbers 

For graphs with large chromatic number, or more precisely with large $(G), our 
bounds on K(r, G) proved above can be improved using the techniques of Alon, 
Makarychev, Makarychev, and Naor [2]. In this section, we show how their bounds 
on K(l, G) generalize to higher values of r. 

Theorem 6.1. Given a graph G = (V,E) and integer 1 < r < logi9(G') 7 we have 

K(r,G)< (^y 

Proof. It suffices to show that for any matrix A : V x V — > R, we have 

SDP,,.(G,A) > n ( - ) SDP QO (G,A). 
Vlog^(G)/ 
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Fix a matrix A : V x V -> M. Let / : V -> S^l -1 be optimal for SDP oc (G, A), 
so that 

A(u,v)f(u)-f(v) = SDP 00 (G,A). 

{u,v}eE 

Let A = d(G), and Z : V x K — >• K be an optimal solution of fl3J). Let J be 
the 2\V\ x 2|V| all-ones matrix and I the 2x2 identity matrix. Since the matrix 
(I ® Z + J)/\ is positive semidcfinite, we obtain from its Gram decomposition 
functions s,t : V — > M. 2 \ v \ that satisfy 

(1) s(u) ■ s(u) = t(u) ■ t(u) = 1 for all ueV. 

(2) s(u) ■ t(u) = 1/A for all u, u G 7. 

(3) s(u) ■ s(v) = t(u) ■ t{v) = for all {u,v} e E. 

Let % be the Hilbert space of vector-valued functions h : R rx l v l — > R r with 
inner product 

(g,h)=E[g(Z).h(Z)} ) 

where the expectation is taken over random r x |V| matrices Z whose entries are 
i.i.d. N(0, 1/r) random variables. 

Let R > 2 be some real number to be set later. Define for every u € V the 
function g u 6 "H by 



R 



if \\Zf(u)\\<R 
otherwise. 



Notice that for every matrix Z e R rx l y l, the vector g u {Z) e M r has Euclidean 
norm at most 1. It follows by linearity of expectation that 



SDP r (G,,4) > E 



^2 A ( u ^ v )9u(Z) ■ g v {Z) = ^2 A ( u > v )(9u,gv) 

■{u,v}eE J {u,v}EE 

We proceed by lower bounding the right-hand side of the above inequality. 
Based on the definition of g u we define two functions h\ € H by 

Zf(u) 



h° u (Z) = ^P- + 9u (Z) 



and 



R 



9u{Z)- 



For every u g V, define the function H u € R 2 l y l ® % by 

ff„ = -s(u) ® /i° + 2A*(m) ® /i*. 

We expand the inner products (g u ,gv) in terms of /(w) • /(i>) and (H U ,H V 
Claim 2. For every {m, t>} G -B me ftawe 



{9u,< 



1 

i?2 



f(u)-f(v)-(H u ,H v ). 



Proof: Simply expanding the inner product (H U ,H V ) gives 



(H U ,H V ) 



Siu) - s{v) -(hlh° v )+4\i(t(u)-t(v)) (hlhl) 



16 

A 
2 



»(«)•*(«)) + (t(u)- S (v))(hlh° v ) 
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It follows from property 3 of s and t that the above terms involving s(u) ■ s(v) and 
t(u) ■ t[v) vanish. By property 2, the remaining terms reduce to 



\{{hlX) + {h l u ,hl))=\ 



E 



Zf(u) 
R 



9u{Z) 



( Zfjv) 



9v(Z) 



-E 



Zf(u) 
R 



Expanding the first expectation gives 



1 

R 2 



E[f(uy Z l Zf(v)]- (g»,g v ) -E 



Zf(u) 
R 



and expanding the second gives 



1 



jpE[f{uyZ*Zf{v)]-{g Ul g v )+E 



Zf(u) 



R 



9u{Z) 



9v{Z) 



9v(Z) 



Zf(v) 
R 



-E 



9u{Z) 



9u{Z) 



9v(Z) 

Zfjv) 
R 

Zf(v) 
R 



Adding these two gives that the last two terms cancel. Since E[Z T Z] = I, what 
remains equals 

which proves the claim. ♦ 
From the above claim it follows that 

A(u,v)(g u ,g v ) = -^ST)F 00 (G,A)- £ A(u,v)(H u ,H v ) 



{u,v}£E 



{u,v}€E 



> [^-m & x\\H u \\ 2 )SDP 00 (G,A), 
H uev 



where \\H U \\ 2 = (H u , H u ). 

By the triangle inequality, we have for every u E V. 

2 



Hu\\ 2 < [ 7 \\hl\\ + 2A||^|| < 



1 /l 



R 2 \2 



2XRE 



Zf(u) 



R 



9u{Z) 



1 \ 2 



By the definition of g u , the vectors Zf(u) and g u are parallel. Moreover, they 
are equal if |jZ/(u)|| < R. Since f(u) is a unit vector, the r entries of the random 
vector Zf{u) are i.i.d. iV(0,l/r) random variables. Hence, 

Zf(u) 



E 



R 



9uiZ) 



r \r/2 



-rp 2 /2 



dpduj r (£ ) 



r r/2 



< 



RT( 



p r e- rp2/2 dp, 



where ui r is the unique rotationally invariant measure on S r , normalized such 
that £D r (5 r - 1 ) = W 2 /T(r/2). Using a substitution of variables, we get 



P e 



-rp 2 /2 



l/2\('-+i)/2 /r + 1 rR 



2 ' 2 



where T(a,x) is the lower incomplete Gamma function [4j Eq. (4.4.5)]. 
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Collecting the terms from above then gives the bound 

if (i 9( r+1 )/ 2 /r + 1 rE 2 \\ 2 \ 
(10) SDP^^l-j^ + A-^r^-,— )j jsDP^A). 

The bound in the theorem follows by setting R as small as possible such that the 
above factor between brackets is some positive constant. 

By Stirling's formula, there is a constant C\ > such that T(x) > C\e~ x x x ^ 1 / 2 
holds (see for example Abramowitz and Stegun [TJ Eq. (6.1.37)]). Hence, for some 
constants c, C > 0, we have 

9(r+l)/2 /cW2 

The power series of the incomplete gamma function [TJ Eq. (6.5.32)] gives that 
if a < x, for some constant C2 > 0, the inequality T(a, x) < C2X a e~ x holds. As 
R > 2, for some constants d, D > 0, we have 

fr + l rR 2 \ _/ r W 2 

(12) r (^— J^^W ■ 

Putting together estimates (|TT|) and P^)l gives 



Vfr(§) V 2 ' 2 ; - v \dR 

Since r < log A there is some constant C such that for R 2 = C"(logA)/r, the 
above value is less than 1/4. It follows that for this value of R, Inequality (fTO]) is 
nontrivial and we get the result. □ 
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